Overview

Dataset statistics

Number of variables21
Number of observations13580
Missing cells13256
Missing cells (%)4.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 MiB
Average record size in memory168.0 B

Variable types

NUM13
CAT8

Warnings

Suburb has a high cardinality: 314 distinct values High cardinality
Address has a high cardinality: 13378 distinct values High cardinality
SellerG has a high cardinality: 268 distinct values High cardinality
Date has a high cardinality: 58 distinct values High cardinality
Bedroom2 is highly correlated with RoomsHigh correlation
Rooms is highly correlated with Bedroom2High correlation
BuildingArea has 6450 (47.5%) missing values Missing
YearBuilt has 5375 (39.6%) missing values Missing
CouncilArea has 1369 (10.1%) missing values Missing
Landsize is highly skewed (γ1 = 95.23740045) Skewed
BuildingArea is highly skewed (γ1 = 77.69154092) Skewed
Address is uniformly distributed Uniform
Car has 1026 (7.6%) zeros Zeros
Landsize has 1939 (14.3%) zeros Zeros

Reproduction

Analysis started2020-10-08 17:28:56.101470
Analysis finished2020-10-08 17:29:29.010838
Duration32.91 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Suburb
Categorical

HIGH CARDINALITY

Distinct314
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size106.1 KiB
Reservoir
 
359
Richmond
 
260
Bentleigh East
 
249
Preston
 
239
Brunswick
 
222
Other values (309)
12251 
ValueCountFrequency (%) 
Reservoir3592.6%
 
Richmond2601.9%
 
Bentleigh East2491.8%
 
Preston2391.8%
 
Brunswick2221.6%
 
Essendon2201.6%
 
South Yarra2021.5%
 
Glen Iris1951.4%
 
Hawthorn1911.4%
 
Coburg1901.4%
 
Other values (304)1125382.9%
 
Frequencies of value counts

Unique

Unique21 ?
Unique (%)0.2%
Histogram of lengths of the category

Length

Max length18
Median length9
Mean length9.79646539
Min length3

Address
Categorical

HIGH CARDINALITY
UNIFORM

Distinct13378
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size106.1 KiB
36 Aberfeldie St
 
3
28 Blair St
 
3
53 William St
 
3
2 Bruce St
 
3
5 Charles St
 
3
Other values (13373)
13565 
ValueCountFrequency (%) 
36 Aberfeldie St3< 0.1%
 
28 Blair St3< 0.1%
 
53 William St3< 0.1%
 
2 Bruce St3< 0.1%
 
5 Charles St3< 0.1%
 
1/1 Clarendon St3< 0.1%
 
13 Robinson St3< 0.1%
 
5 Margaret St3< 0.1%
 
14 Arthur St3< 0.1%
 
2/13 Walker St2< 0.1%
 
Other values (13368)1355199.8%
 
Frequencies of value counts

Unique

Unique13185 ?
Unique (%)97.1%
Histogram of lengths of the category

Length

Max length27
Median length13
Mean length13.51045655
Min length8

Rooms
Real number (ℝ≥0)

HIGH CORRELATION

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.937997054
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Memory size106.1 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum10
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9557479384
Coefficient of variation (CV)0.3253059553
Kurtosis0.7940679895
Mean2.937997054
Median Absolute Deviation (MAD)1
Skewness0.3764780328
Sum39898
Variance0.9134541218
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
3588143.3%
 
2364826.9%
 
4268819.8%
 
16815.0%
 
55964.4%
 
6670.5%
 
7100.1%
 
880.1%
 
101< 0.1%
 
ValueCountFrequency (%) 
16815.0%
 
2364826.9%
 
3588143.3%
 
4268819.8%
 
55964.4%
 
ValueCountFrequency (%) 
101< 0.1%
 
880.1%
 
7100.1%
 
6670.5%
 
55964.4%
 

Type
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size106.1 KiB
h
9449 
u
3017 
t
1114 
ValueCountFrequency (%) 
h944969.6%
 
u301722.2%
 
t11148.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Price
Real number (ℝ≥0)

Distinct2204
Distinct (%)16.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1075684.079
Minimum85000
Maximum9000000
Zeros0
Zeros (%)0.0%
Memory size106.1 KiB

Quantile statistics

Minimum85000
5-th percentile405000
Q1650000
median903000
Q31330000
95-th percentile2290050
Maximum9000000
Range8915000
Interquartile range (IQR)680000

Descriptive statistics

Standard deviation639310.7243
Coefficient of variation (CV)0.5943294472
Kurtosis9.874338886
Mean1075684.079
Median Absolute Deviation (MAD)313000
Skewness2.239624313
Sum1.46077898e+10
Variance4.087182022e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
11000001130.8%
 
13000001090.8%
 
8000001090.8%
 
6500001090.8%
 
6000001040.8%
 
1000000970.7%
 
1200000970.7%
 
900000950.7%
 
700000910.7%
 
1400000890.7%
 
Other values (2194)1256792.5%
 
ValueCountFrequency (%) 
850001< 0.1%
 
1310001< 0.1%
 
1450002< 0.1%
 
1600001< 0.1%
 
1700002< 0.1%
 
ValueCountFrequency (%) 
90000001< 0.1%
 
80000001< 0.1%
 
76500001< 0.1%
 
65000001< 0.1%
 
64000001< 0.1%
 

Method
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size106.1 KiB
S
9022 
SP
1703 
PI
1564 
VB
1199 
SA
 
92
ValueCountFrequency (%) 
S902266.4%
 
SP170312.5%
 
PI156411.5%
 
VB11998.8%
 
SA920.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length1
Mean length1.335640648
Min length1

SellerG
Categorical

HIGH CARDINALITY

Distinct268
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size106.1 KiB
Nelson
1565 
Jellis
1316 
hockingstuart
1167 
Barry
1011 
Ray
 
701
Other values (263)
7820 
ValueCountFrequency (%) 
Nelson156511.5%
 
Jellis13169.7%
 
hockingstuart11678.6%
 
Barry10117.4%
 
Ray7015.2%
 
Marshall6594.9%
 
Buxton6324.7%
 
Biggin3932.9%
 
Brad3422.5%
 
Woodards3012.2%
 
Other values (258)549340.4%
 
Frequencies of value counts

Unique

Unique78 ?
Unique (%)0.6%
Histogram of lengths of the category

Length

Max length23
Median length6
Mean length6.402503682
Min length1

Date
Categorical

HIGH CARDINALITY

Distinct58
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size106.1 KiB
27/05/2017
 
473
3/06/2017
 
395
12/08/2017
 
387
17/06/2017
 
374
27/11/2016
 
362
Other values (53)
11589 
ValueCountFrequency (%) 
27/05/20174733.5%
 
3/06/20173952.9%
 
12/08/20173872.8%
 
17/06/20173742.8%
 
27/11/20163622.7%
 
29/07/20173412.5%
 
4/03/20173372.5%
 
25/02/20173332.5%
 
24/06/20173292.4%
 
10/12/20163192.3%
 
Other values (48)993073.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length10
Median length10
Mean length9.724815906
Min length9

Distance
Real number (ℝ≥0)

Distinct202
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.13777614
Minimum0
Maximum48.1
Zeros6
Zeros (%)< 0.1%
Memory size106.1 KiB

Quantile statistics

Minimum0
5-th percentile2.6
Q16.1
median9.2
Q313
95-th percentile20.6
Maximum48.1
Range48.1
Interquartile range (IQR)6.9

Descriptive statistics

Standard deviation5.868724943
Coefficient of variation (CV)0.5788966792
Kurtosis5.260001109
Mean10.13777614
Median Absolute Deviation (MAD)3.35
Skewness1.676937083
Sum137671
Variance34.44193246
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
11.27395.4%
 
9.23672.7%
 
13.93242.4%
 
7.83062.3%
 
4.62631.9%
 
132521.9%
 
5.22481.8%
 
82481.8%
 
13.82371.7%
 
2.62351.7%
 
Other values (192)1036176.3%
 
ValueCountFrequency (%) 
06< 0.1%
 
0.780.1%
 
1.2330.2%
 
1.35< 0.1%
 
1.5170.1%
 
ValueCountFrequency (%) 
48.11< 0.1%
 
47.41< 0.1%
 
47.33< 0.1%
 
45.990.1%
 
45.21< 0.1%
 

Postcode
Real number (ℝ≥0)

Distinct198
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3105.301915
Minimum3000
Maximum3977
Zeros0
Zeros (%)0.0%
Memory size106.1 KiB

Quantile statistics

Minimum3000
5-th percentile3013
Q13044
median3084
Q33148
95-th percentile3204
Maximum3977
Range977
Interquartile range (IQR)104

Descriptive statistics

Standard deviation90.67696409
Coefficient of variation (CV)0.02920069178
Kurtosis29.15686787
Mean3105.301915
Median Absolute Deviation (MAD)50
Skewness4.076152215
Sum42170000
Variance8222.311816
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
30733592.6%
 
30203062.3%
 
31212922.2%
 
30402902.1%
 
30462842.1%
 
31652491.8%
 
30582461.8%
 
31632451.8%
 
30122421.8%
 
30722391.8%
 
Other values (188)1082879.7%
 
ValueCountFrequency (%) 
3000460.3%
 
3002220.2%
 
3003310.2%
 
3006410.3%
 
30083< 0.1%
 
ValueCountFrequency (%) 
397780.1%
 
39764< 0.1%
 
39106< 0.1%
 
38103< 0.1%
 
38091< 0.1%
 

Bedroom2
Real number (ℝ≥0)

HIGH CORRELATION

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.914727541
Minimum0
Maximum20
Zeros16
Zeros (%)0.1%
Memory size106.1 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile4
Maximum20
Range20
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9659210617
Coefficient of variation (CV)0.33139326
Kurtosis8.074963808
Mean2.914727541
Median Absolute Deviation (MAD)1
Skewness0.7740822106
Sum39582
Variance0.9330034975
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%) 
3589643.4%
 
2373727.5%
 
4260119.2%
 
16915.1%
 
55564.1%
 
6630.5%
 
0160.1%
 
7100.1%
 
85< 0.1%
 
93< 0.1%
 
Other values (2)2< 0.1%
 
ValueCountFrequency (%) 
0160.1%
 
16915.1%
 
2373727.5%
 
3589643.4%
 
4260119.2%
 
ValueCountFrequency (%) 
201< 0.1%
 
101< 0.1%
 
93< 0.1%
 
85< 0.1%
 
7100.1%
 

Bathroom
Real number (ℝ≥0)

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.534241532
Minimum0
Maximum8
Zeros34
Zeros (%)0.3%
Memory size106.1 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile3
Maximum8
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.6917117225
Coefficient of variation (CV)0.4508493012
Kurtosis3.594973134
Mean1.534241532
Median Absolute Deviation (MAD)0
Skewness1.377405972
Sum20835
Variance0.478465107
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
1751255.3%
 
2497436.6%
 
39176.8%
 
41060.8%
 
0340.3%
 
5280.2%
 
65< 0.1%
 
82< 0.1%
 
72< 0.1%
 
ValueCountFrequency (%) 
0340.3%
 
1751255.3%
 
2497436.6%
 
39176.8%
 
41060.8%
 
ValueCountFrequency (%) 
82< 0.1%
 
72< 0.1%
 
65< 0.1%
 
5280.2%
 
41060.8%
 

Car
Real number (ℝ≥0)

ZEROS

Distinct11
Distinct (%)0.1%
Missing62
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean1.610075455
Minimum0
Maximum10
Zeros1026
Zeros (%)7.6%
Memory size106.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q32
95-th percentile3
Maximum10
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9626335192
Coefficient of variation (CV)0.5978809976
Kurtosis5.193182788
Mean1.610075455
Median Absolute Deviation (MAD)1
Skewness1.369675926
Sum21765
Variance0.9266632924
MonotocityNot monotonic
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%) 
2559141.2%
 
1550940.6%
 
010267.6%
 
37485.5%
 
45063.7%
 
5630.5%
 
6540.4%
 
890.1%
 
780.1%
 
103< 0.1%
 
(Missing)620.5%
 
ValueCountFrequency (%) 
010267.6%
 
1550940.6%
 
2559141.2%
 
37485.5%
 
45063.7%
 
ValueCountFrequency (%) 
103< 0.1%
 
91< 0.1%
 
890.1%
 
780.1%
 
6540.4%
 

Landsize
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct1448
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean558.4161267
Minimum0
Maximum433014
Zeros1939
Zeros (%)14.3%
Memory size106.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1177
median440
Q3651
95-th percentile995
Maximum433014
Range433014
Interquartile range (IQR)474

Descriptive statistics

Standard deviation3990.669241
Coefficient of variation (CV)7.146407581
Kurtosis10180.34683
Mean558.4161267
Median Absolute Deviation (MAD)236
Skewness95.23740045
Sum7583291
Variance15925440.99
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0193914.3%
 
6501030.8%
 
697710.5%
 
700480.4%
 
585470.3%
 
534420.3%
 
590390.3%
 
696360.3%
 
649360.3%
 
603350.3%
 
Other values (1438)1118482.4%
 
ValueCountFrequency (%) 
0193914.3%
 
12< 0.1%
 
21< 0.1%
 
31< 0.1%
 
51< 0.1%
 
ValueCountFrequency (%) 
4330141< 0.1%
 
760001< 0.1%
 
751001< 0.1%
 
445001< 0.1%
 
414001< 0.1%
 

BuildingArea
Real number (ℝ≥0)

MISSING
SKEWED

Distinct602
Distinct (%)8.4%
Missing6450
Missing (%)47.5%
Infinite0
Infinite (%)0.0%
Mean151.9676499
Minimum0
Maximum44515
Zeros17
Zeros (%)0.1%
Memory size106.1 KiB

Quantile statistics

Minimum0
5-th percentile51
Q193
median126
Q3174
95-th percentile294
Maximum44515
Range44515
Interquartile range (IQR)81

Descriptive statistics

Standard deviation541.0145376
Coefficient of variation (CV)3.560063856
Kurtosis6347.802222
Mean151.9676499
Median Absolute Deviation (MAD)39
Skewness77.69154092
Sum1083529.344
Variance292696.7299
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1201140.8%
 
110890.7%
 
100880.6%
 
130840.6%
 
115770.6%
 
150740.5%
 
104660.5%
 
90650.5%
 
140640.5%
 
112630.5%
 
Other values (592)634646.7%
 
(Missing)645047.5%
 
ValueCountFrequency (%) 
0170.1%
 
1110.1%
 
2160.1%
 
3200.1%
 
44< 0.1%
 
ValueCountFrequency (%) 
445151< 0.1%
 
67911< 0.1%
 
35581< 0.1%
 
31121< 0.1%
 
15611< 0.1%
 

YearBuilt
Real number (ℝ≥0)

MISSING

Distinct144
Distinct (%)1.8%
Missing5375
Missing (%)39.6%
Infinite0
Infinite (%)0.0%
Mean1964.684217
Minimum1196
Maximum2018
Zeros0
Zeros (%)0.0%
Memory size106.1 KiB

Quantile statistics

Minimum1196
5-th percentile1900
Q11940
median1970
Q31999
95-th percentile2012
Maximum2018
Range822
Interquartile range (IQR)59

Descriptive statistics

Standard deviation37.27376222
Coefficient of variation (CV)0.01897188459
Kurtosis21.22603222
Mean1964.684217
Median Absolute Deviation (MAD)30
Skewness-1.54127876
Sum16120234
Variance1389.33335
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
19708666.4%
 
19607255.3%
 
19505804.3%
 
19003412.5%
 
19803382.5%
 
20003002.2%
 
19202802.1%
 
19302742.0%
 
19102401.8%
 
19402381.8%
 
Other values (134)402329.6%
 
(Missing)537539.6%
 
ValueCountFrequency (%) 
11961< 0.1%
 
18301< 0.1%
 
18504< 0.1%
 
18541< 0.1%
 
18561< 0.1%
 
ValueCountFrequency (%) 
20181< 0.1%
 
2017180.1%
 
2016580.4%
 
2015650.5%
 
20141000.7%
 

CouncilArea
Categorical

MISSING

Distinct33
Distinct (%)0.3%
Missing1369
Missing (%)10.1%
Memory size106.1 KiB
Moreland
1163 
Boroondara
1160 
Moonee Valley
997 
Darebin
934 
Glen Eira
848 
Other values (28)
7109 
ValueCountFrequency (%) 
Moreland11638.6%
 
Boroondara11608.5%
 
Moonee Valley9977.3%
 
Darebin9346.9%
 
Glen Eira8486.2%
 
Stonnington7195.3%
 
Maribyrnong6925.1%
 
Yarra6474.8%
 
Port Phillip6284.6%
 
Banyule5944.4%
 
Other values (23)382928.2%
 
(Missing)136910.1%
 
Frequencies of value counts

Unique

Unique2 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length17
Median length9
Mean length8.457437408
Min length3

Lattitude
Real number (ℝ)

Distinct6503
Distinct (%)47.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-37.80920273
Minimum-38.18255
Maximum-37.40853
Zeros0
Zeros (%)0.0%
Memory size106.1 KiB

Quantile statistics

Minimum-38.18255
5-th percentile-37.9348
Q1-37.8568225
median-37.802355
Q3-37.7564
95-th percentile-37.6989385
Maximum-37.40853
Range0.77402
Interquartile range (IQR)0.1004225

Descriptive statistics

Standard deviation0.0792598226
Coefficient of variation (CV)-0.002096310339
Kurtosis1.573252695
Mean-37.80920273
Median Absolute Deviation (MAD)0.050455
Skewness-0.4266949343
Sum-513448.9731
Variance0.006282119479
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
-37.8361210.2%
 
-37.7969160.1%
 
-37.8424160.1%
 
-37.7609140.1%
 
-37.8161130.1%
 
-37.8414130.1%
 
-37.7679130.1%
 
-37.7634130.1%
 
-37.8573130.1%
 
-37.8198130.1%
 
Other values (6493)1343598.9%
 
ValueCountFrequency (%) 
-38.182551< 0.1%
 
-38.174881< 0.1%
 
-38.168021< 0.1%
 
-38.167621< 0.1%
 
-38.166241< 0.1%
 
ValueCountFrequency (%) 
-37.408531< 0.1%
 
-37.453921< 0.1%
 
-37.457091< 0.1%
 
-37.483811< 0.1%
 
-37.487011< 0.1%
 

Longtitude
Real number (ℝ≥0)

Distinct7063
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean144.9952162
Minimum144.43181
Maximum145.52635
Zeros0
Zeros (%)0.0%
Memory size106.1 KiB

Quantile statistics

Minimum144.43181
5-th percentile144.835785
Q1144.9296
median145.0001
Q3145.058305
95-th percentile145.153631
Maximum145.52635
Range1.09454
Interquartile range (IQR)0.128705

Descriptive statistics

Standard deviation0.1039155614
Coefficient of variation (CV)0.0007166826888
Kurtosis1.758615585
Mean144.9952162
Median Absolute Deviation (MAD)0.063415
Skewness-0.2109908954
Sum1969035.036
Variance0.0107984439
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
144.9966170.1%
 
145.0104150.1%
 
144.985140.1%
 
145.0001130.1%
 
144.991130.1%
 
145.0243120.1%
 
145.021120.1%
 
144.997120.1%
 
145.0043120.1%
 
145.0116120.1%
 
Other values (7053)1344899.0%
 
ValueCountFrequency (%) 
144.431811< 0.1%
 
144.485711< 0.1%
 
144.542371< 0.1%
 
144.545321< 0.1%
 
144.551061< 0.1%
 
ValueCountFrequency (%) 
145.526351< 0.1%
 
145.482731< 0.1%
 
145.470521< 0.1%
 
145.453761< 0.1%
 
145.44531< 0.1%
 

Regionname
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size106.1 KiB
Southern Metropolitan
4695 
Northern Metropolitan
3890 
Western Metropolitan
2948 
Eastern Metropolitan
1471 
South-Eastern Metropolitan
 
450
Other values (3)
 
126
ValueCountFrequency (%) 
Southern Metropolitan469534.6%
 
Northern Metropolitan389028.6%
 
Western Metropolitan294821.7%
 
Eastern Metropolitan147110.8%
 
South-Eastern Metropolitan4503.3%
 
Eastern Victoria530.4%
 
Northern Victoria410.3%
 
Western Victoria320.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length26
Median length21
Mean length20.79690722
Min length16

Propertycount
Real number (ℝ≥0)

Distinct311
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7454.417378
Minimum249
Maximum21650
Zeros0
Zeros (%)0.0%
Memory size106.1 KiB

Quantile statistics

Minimum249
5-th percentile2185
Q14380
median6555
Q310331
95-th percentile14949
Maximum21650
Range21401
Interquartile range (IQR)5951

Descriptive statistics

Standard deviation4378.581772
Coefficient of variation (CV)0.5873808172
Kurtosis1.217820011
Mean7454.417378
Median Absolute Deviation (MAD)2695.5
Skewness1.069339349
Sum101230988
Variance19171978.33
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
216503592.6%
 
88702982.2%
 
149492601.9%
 
109692491.8%
 
145772391.8%
 
119182221.6%
 
92642201.6%
 
148872021.5%
 
104121951.4%
 
113081911.4%
 
Other values (301)1114582.1%
 
ValueCountFrequency (%) 
2491< 0.1%
 
3896< 0.1%
 
3942< 0.1%
 
43870.1%
 
4572< 0.1%
 
ValueCountFrequency (%) 
216503592.6%
 
17496460.3%
 
173843< 0.1%
 
17093130.1%
 
17055240.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

SuburbAddressRoomsTypePriceMethodSellerGDateDistancePostcodeBedroom2BathroomCarLandsizeBuildingAreaYearBuiltCouncilAreaLattitudeLongtitudeRegionnamePropertycount
0Abbotsford85 Turner St2h1480000.0SBiggin3/12/20162.53067.02.01.01.0202.0NaNNaNYarra-37.7996144.9984Northern Metropolitan4019.0
1Abbotsford25 Bloomburg St2h1035000.0SBiggin4/02/20162.53067.02.01.00.0156.079.01900.0Yarra-37.8079144.9934Northern Metropolitan4019.0
2Abbotsford5 Charles St3h1465000.0SPBiggin4/03/20172.53067.03.02.00.0134.0150.01900.0Yarra-37.8093144.9944Northern Metropolitan4019.0
3Abbotsford40 Federation La3h850000.0PIBiggin4/03/20172.53067.03.02.01.094.0NaNNaNYarra-37.7969144.9969Northern Metropolitan4019.0
4Abbotsford55a Park St4h1600000.0VBNelson4/06/20162.53067.03.01.02.0120.0142.02014.0Yarra-37.8072144.9941Northern Metropolitan4019.0
5Abbotsford129 Charles St2h941000.0SJellis7/05/20162.53067.02.01.00.0181.0NaNNaNYarra-37.8041144.9953Northern Metropolitan4019.0
6Abbotsford124 Yarra St3h1876000.0SNelson7/05/20162.53067.04.02.00.0245.0210.01910.0Yarra-37.8024144.9993Northern Metropolitan4019.0
7Abbotsford98 Charles St2h1636000.0SNelson8/10/20162.53067.02.01.02.0256.0107.01890.0Yarra-37.8060144.9954Northern Metropolitan4019.0
8Abbotsford6/241 Nicholson St1u300000.0SBiggin8/10/20162.53067.01.01.01.00.0NaNNaNYarra-37.8008144.9973Northern Metropolitan4019.0
9Abbotsford10 Valiant St2h1097000.0SBiggin8/10/20162.53067.03.01.02.0220.075.01900.0Yarra-37.8010144.9989Northern Metropolitan4019.0

Last rows

SuburbAddressRoomsTypePriceMethodSellerGDateDistancePostcodeBedroom2BathroomCarLandsizeBuildingAreaYearBuiltCouncilAreaLattitudeLongtitudeRegionnamePropertycount
13570Wantirna South34 Fewster Dr3h970000.0SBarry26/08/201714.73152.03.02.02.0674.0NaNNaNNaN-37.88360145.22805Eastern Metropolitan7082.0
13571Wantirna South15 Mara Cl4h1330000.0SBarry26/08/201714.73152.04.02.02.0717.0191.01980.0NaN-37.86887145.22116Eastern Metropolitan7082.0
13572Watsonia76 Kenmare St2h650000.0PIMorrison26/08/201714.53087.02.01.01.0210.079.02006.0NaN-37.70657145.07878Northern Metropolitan2329.0
13573Werribee5 Nuragi Ct4h635000.0Shockingstuart26/08/201714.73030.04.02.01.0662.0172.01980.0NaN-37.89327144.64789Western Metropolitan16166.0
13574Westmeadows9 Black St3h582000.0SRed26/08/201716.53049.03.02.02.0256.0NaNNaNNaN-37.67917144.89390Northern Metropolitan2474.0
13575Wheelers Hill12 Strada Cr4h1245000.0SBarry26/08/201716.73150.04.02.02.0652.0NaN1981.0NaN-37.90562145.16761South-Eastern Metropolitan7392.0
13576Williamstown77 Merrett Dr3h1031000.0SPWilliams26/08/20176.83016.03.02.02.0333.0133.01995.0NaN-37.85927144.87904Western Metropolitan6380.0
13577Williamstown83 Power St3h1170000.0SRaine26/08/20176.83016.03.02.04.0436.0NaN1997.0NaN-37.85274144.88738Western Metropolitan6380.0
13578Williamstown96 Verdon St4h2500000.0PISweeney26/08/20176.83016.04.01.05.0866.0157.01920.0NaN-37.85908144.89299Western Metropolitan6380.0
13579Yarraville6 Agnes St4h1285000.0SPVillage26/08/20176.33013.04.01.01.0362.0112.01920.0NaN-37.81188144.88449Western Metropolitan6543.0